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Preface 



SBIA, the Brazilian Symposium on Artificial Intelligence, is a biennial event 
intended to be the main forum of the AI community in Brazil. The SBIA 2004 
was the 17tlr issue of the series initiated in 1984. Since 1995 SBIA has been 
accepting papers written and presented only in English, attracting researchers 
from all over the world. At that time it also started to have an international 
program committee, keynote invited speakers, and proceedings published in the 
Lecture Notes in Artificial Intelligence (LNAI) series of Springer (SBIA 1995, 
Vol. 991, SBIA 1996, Vol. 1159, SBIA 1998, Vol. 1515, SBIA 2000, Vol. 1952, 
SBIA 2002, Vol. 2507). 

SBIA 2004 was sponsored by the Brazilian Computer Society (SBC). It was 
held from September 29 to October 1 in the city of Sao Luis, in the northeast 
of Brazil, together with the Brazilian Symposium on Neural Networks (SBRN). 
This followed a trend of joining the AI and ANN communities to make the joint 
event a very exciting one. In particular, in 2004 these two events were also held 
together with the IEEE International Workshop on Machine Learning and Signal 
Processing (MMLP), formerly NNLP. 

The organizational structure of SBIA 2004 was similar to other international 
scientific conferences. The backbone of the conference was the technical program 
which was complemented by invited talks, workshops, etc. on the main AI topics. 

The call for papers attracted 209 submissions from 21 countries. Each paper 
submitted to SBIA was reviewed by three referees. From this total, 54 papers 
from 10 countries were accepted and are included in this volume. This made 
SBIA a very competitive conference with an acceptance rate of 25.8%. The 
evaluation of this large number of papers was a challenge in terms of reviewing 
and maintaining the high quality of the preceding SBIA conferences. All these 
goals would not have been achieved without the excellent work of the members 
of the program committee - composed of 80 researchers from 18 countries - and 
the auxiliary reviewers. 

Thus, we would like to express our sincere gratitude to all those who helped 
make SBIA 2004 happen. First of all we thank all the contributing authors; 
special thanks go to the members of the program committee and reviewers for 
their careful work in selecting the best papers. Thanks go also to the steering 
committee for its guidance and support, to the local organization people, and to 
the students who helped with the website design and maintenance, the papers 
submission site, and with the preparation of this volume. Finally, we would like 
to thank the Brazilian funding agencies and Springer for supporting this book. 

Porto Alegre, September 2004 Ana L.C. Bazzan 

(Chair of the Program Committee) 
Sofiane Labidi 
(General Chair) 
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Abstract. We examine modal logical systems, with generalized opera- 
tors, for the precise treatment of vague notions such as ‘often’, ‘a mean- 
ingful subset of a whole’, ‘most’, ‘generally’ etc. The intuition of ‘most’ 
as “all but for a ‘negligible’ set of exceptions” is made precise by means 
of filters. We examine a modal logic, with a new modality for a local 
version of ‘most’ and present a sound and complete axiom system. We 
also discuss some variants of this modal logic. 

Keywords: Modal logic, vague notions, most, filter, knowledge repre- 
sentation. 



1 Introduction 

We examine modal logical systems, with generalized operators, for the precise 
treatment of assertions involving some versions of vague notions such as ‘often’, 
‘a meaningful subset of a whole’, ‘most’, ‘generally’ etc. We wish to express these 
vague notions and reason about them. 

Vague notions, such as those mentioned above, occur often in ordinary lan- 
guage and in some branches of science, some examples being “most bodies ex- 
pand when heated” and “typical birds fly”. Vague terms such as ‘likely’ and 
‘prone’ are often used in more elaborate expressions involving ‘propensity’, e.g. 
“A patient whose genetic background indicates a certain propensity is prone 
to some ailments”. A precise treatment of these notions is required for reason- 
ing about them. Generalized quantifiers have been employed to capture some 
traditional mathematical notions [2] and defaults [10]. A logic with various gen- 
eralized quantifiers has been suggested to treat quantified sentences in natural 
language [1] and an extension of first-order logic with generalized quantifiers for 
capturing a sense of ‘generally’ is presented in [5]. The idea of this approach 
is formulating ‘most’ as ‘holding almost universally’. This seems quite natural, 
once we interpret ‘most’ as “all, but for a ‘negligible’ set of exceptions”. 

Modal logics are specification formalisms which are simpler to be handled 
than first-order logic, due to the hiding of variables and quantifiers through the 
modal operators (box and diamond). In this paper we present a modal coun- 
terpart of filter logic, internalizing the generalized quantifier through a new 



A.L.C. Bazzan and S. Labidi (Eds.): SBIA 2004, LNAI 3171, pp. 1-10, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 




2 



Mario Benevides et al. 



modality V whose behavior is intermediate between those of the classical modal 
operators □ and <>• Thus one will be able to express “a reply to a message 
will be received almost always”: Vreceive-reply, “eventually a reply to a message 
will be received almost always”: (> Vreceive-reply, “the system generally operates 
correctly”: Vcorrect-operation, etc. 

An important class of problems involves the stable property detection. In 
a more concrete setting consider the following situation. A stable property is 
one which once it becomes true it remains true forever: deadlock, termination 
and loss of a token are examples. In these problems, processes communicate 
by sending and receiving messages. A process can record its own state and the 
messages it sends and receives, nothing else. 

Many problems in distributed systems can be solved by detecting global 
states. An example of this kind of algorithm is the Clrandy and Lamport Dis- 
tributed Snapshots algorithm for determining global states of distributed systems 
[6]. Each process records its own state and the two processes that a channel is 
incident on cooperate in recording the channel state. One cannot ensure that 
the states of all processes and channels will be recorded at the same instant, 
because there is no global clock, however, we require that the recorded process 
and channel states form a meaningful global state. The following text illustrates 
this problem [6]: “The state detection algorithm plays the role of a group of 
photographers observing a panoramic, dynamic scene, such as a sky filled with 
migrating birds - a scene so vast that it cannot be captured by a single photo- 
graph. The photographers must take several snapshots and piece the snapshots 
together to form a picture of the overall scene. The snapshots cannot all be taken 
at precisely the same instant because of synchronization problems. Furthermore, 
the photographers should not disturb the process that is being photographed; 
(...) Yet, the composite picture should be meaningful. The problem before us 
is to define ‘meaningful’ and then to determine how the photographs should be 
taken.” 

If we take the modality V to capture the notion of meaningful, then the 
formula S/cj) means: u (f> is true in a meaningful set of states”. Returning to the 
example of Clrandy and Lamport Algorithm, the formula: 

(V ( /\ (fa A ifj A Oij)) -> Dy) 

i,j=l..n 

would mean “if in a meaningful set of states, for each pair of processes i and j, 
the snapshot of process i’s local state has property fa, snapshot of process j has 
property ipj and the snapshot of the state of channel ij has property % , then it is 
always the case that global stable property 7 holds forever”. So we can express 
relationships among local process states, global system states and distributed 
computation’s properties even if we cannot perfectly identify the global state at 
each time; for the purpose of evaluating stable properties, a set of meaningful 
states that can be figured out from the local snapshots collected should suffice. 

Another interesting example comes from Game Theory. In Extensive Games 
with Imperfect Information (well defined in [9] ) , a player may not be sure about 
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the complete past history that has already been played. But, based on a mean- 
ingful part of the history he/she has in mind, lre/she may still be able to decide 
which action to choose. The following formula can express this fact 

□ ((turrij A V(^i A • • • A )) — > perform-action(a)) 

The formula above means: “it is always the case that, if it is player’s i turn and 
properties <j>i, ■ ■ ■ , </> n are true in a meaningful part of his/her history, then player 
i should choose action a to perform” . This is in fact the way many absent-minded 
players reason, especially in games with lots of turns like ‘War’, Chess, or even 
a financial market game. 

We present a sound and complete axiomatization for generalized modal logic 
as a first step towards the development of a modal framework for generalized 
logics where one can take advantage of the existing frameworks for modal logics 
extending them to these logics. 

The structure of this paper is as follows. We begin by motivating families, 
such as filters, for capturing some intuitive ideas of ‘generally’. Next, we briefly 
review a system for reasoning about generalized assertions in Sect. 3. In Sect. 
4, we introduce our modal filter logic. In Sect. 5 we comment on how to adapt 
our ideas to some variants of vague modal logics. Sect. 6 gives some concluding 
remarks. 



2 Assigning Precise Meaning to Generalized Notions 

We now indicate how one can arrive at the idea of filters [4] for capturing some 
intuitive ideas of ‘most’, ‘meaningful’, etc. Our approach relies on the familiar 
intuition of ‘most’ as “all but for a ‘negligible’ set of exceptions” as well as on 
some related notions. We discuss, trying to explain, some issues in the treatment 
of ‘most’, and the same approach can be applied in treating ‘meaningful’, ‘often’, 
etc. 

2.1 Some Accounts for ‘Most’ 

Various interpretations seem to be associated with vague notions of ‘most’. The 
intended meaning of “most objects have a given property” can be given either 
directly, in terms of the set of objects having the property, or by means of the 
set of exceptions, those failing to have it. In either case, a precise formulation 
hinges on some ideas concerning these sets. We shall now examine some proposals 
stemming from accounts for ‘most’. 

Some accounts for ‘most’ try to explain it in terms of relative frequency 
or size. For instance, one would understand “most Brazilians like football” as 
the “the Brazilians that like football form a ‘likely’ portion”, with more than, 
say, 75% of the population, or “the Brazilians that like football form a ‘large’ 
set”, in that their number is above, say, 120 million. These accounts of ‘most’ 
may be termed “metric”, as they try to reduce it to a measurable aspect, so 
to speak. They seek to explicate “most people have property <p” as “the people 
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having ip form a ‘likely’ (or ‘large’) set”, i.e. a set having ‘high’ relative frequency 
(or cardinality), with ‘high’ understood as above a given threshold. The next 
example shows a relaxed variant of these metric accounts. 

Example 1. Imagine that one accepts the assertions “most naturals are larger 
than fifteen” and “most naturals do not divide twelve” about the universe of 
natural numbers. Then, one would probably accept also the assertions: 

(V) “Most naturals are larger than fifteen or even” 

(A) “Most naturals are larger than fifteen and do not divide twelve” 

Acceptance of the first two assertions, as well as inferring (V) from them, 
might be explained by metric accounts, but this does not seem to be the case 
with assertion (A). A possible account for this situation is as follows. Both sets 
F, of naturals below fifteen, and T, of divisors of twelve, are finite. So, their 
union still form a finite set. 

This example uses an account based on sizes of the extensions: it explains 
“most naturals have property </?” as “the naturals failing to have p form a ‘small’ 
set”, where ‘small’ is taken as finite. Similarly, one would interpret “most reals 
are irrational” as “the rational reals form a ‘small’ set”, with ‘small’ now un- 
derstood as (at most) denumerable. This account is still quantitative, but more 
relaxed. It explicates “most objects have property p" as “those failing to have 
< p form a ‘small’ set”, in a given sense of ‘small’. 

As more neutral names encompassing these notions, we prefer to use ‘sizable’, 
instead of ‘large’ or ‘likely’, and ‘negligible’ for ‘unlikely’ or ‘small’. The previous 
terms are vague, the more so with the new ones. This, however, may be advanta- 
geous. The reliance on a - somewhat arbitrary - threshold is less stringent and 
they have a wider range of applications, stemming from the liberal interpretation 
of ‘sizable’ as carrying considerable weight or importance. Notice that these no- 
tions of ‘sizable’ and ‘negligible’ are relative to the situation. (In “uninteresting 
meetings are those attended only by junior staff”, the sets including only junior 
staff members are understood as ‘negligible’.) 



2.2 Families for ‘Most’ 

We now indicate how the preceding ideas can be conveyed by means of families, 
thus leading to filters [4] for capturing some notions of ‘most’. One can under- 
stand “most birds fly” as “the non-flying birds form a ‘negligible’ set”. This 
indicates that the intended meaning of “most objects have p" may be rendered 
as “the set of objects failing to have p is negligible”, in the sense that it is in 
a given family of negligible sets. The relative character of ‘negligible’ (and ‘siz- 
able’) is embodied in the family of negligible sets, which may vary according 
to the situation. Such families, however, can be expected to share some general 
properties, if they are to be appropriate for capturing notions of ‘sizable’, such 
as ‘large’ or ‘likely’. Some properties that such a family may, or may not, be 
expected to have are illustrated in the next example. 
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Example 2. Imagine that one accepts the assertions: 

(/3) “Most American males like beer” 

(a) “Most American males like sports” and 

(e) “Most American are Democrats or Republicans” 

In this case, one is likely to accept also the two assertions: 

(D) “Most American males like beverages” 

(fl) “Most American males like beer and sports” 

Acceptance of Q) should be clear. As for (n), its acceptance may be ex- 
plained by exceptions. (As the exceptional sets of non beer-lovers and of non- 
sports-lovers have negligibly few elements, it is reasonable to say that “negligibly 
few American males fail to like beer or sports”, so “most American males like 
beer and sports”.) In contrast, even though one accepts (e), neither one of the 
assertions “most American males are Democrats” and “most American males 
are Republicans” seems to be equally acceptable. 

This example hinges on the following ideas: if B C W and B has ‘most’ 
elements, then W also has ‘most’ elements; if both B and S have ‘negligibly few’ 
elements, then BUS will also have ‘negligibly few’ elements; a union DUR may 
have ‘most’ elements, without either D or R having ‘most’ elements. 

We now postulate reasonable properties of a family J\f C T(V) of negligible 
sets (in the sense of carrying little weight or importance) of a universe V . 

(C) JsATifXCJVeA, “subsets of negligible sets are negligible”. 

(0) 0 € A f, “the empty set 0 is negligible” . 

(V) V ^ A/", “the universe V is not negligible”. 

(U) N' U N" G U if TV', N" G A f, “unions of negligible sets are negligible”. 

These postulates can be explained by means of a notion of ‘having about the 
same importance’ [12]. Postulates (0) and ( V ) concern the non-triviality of our 
notion of ‘negligible’. Also, (U) is not necessarily satisfied by families that may 
be appropriate for some weaker notions, such as ‘several’ or ‘many’. In virtue 
of these postulates, the family A f of negligible sets is non-empty and proper as 
well as closed under subsets and union, thus forming an ideal. Dually, a family 
of sizable sets - of those having ‘most’ elements - is a proper filter (but not 
necessarily an ultrafilter [4]). 

Conversely, each proper filter gives rise to a non-trivial notion of ‘most’. Thus, 
the interpretation of “most objects have property ip ” as “the set of objects failing 
to have ip is negligible” amounts to “the set of objects having ip belongs to a 
given proper filter”. The properties of the family A f are intuitive and coincide 
with those of ideals. As the notion of ‘most’ was taken as the dual of ‘negligible’, 
it is natural to explain families of sizeable sets in terms of filters (dual of ideals) . 
So, generalized quantifiers, ranging over families of sets [1], appear natural to 
capture these notions. 

3 Filter Logic 

Filter logic extends classical first-order logic by a generalised quantifier V, whose 
intended interpretation is ‘generally’. In this section we briefly review filter logic: 
its syntax, semantics and axiomatics. 
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Given a signature p, we let L(p) be the usual first-order language (with equal- 
ity =) of signature p and use L v (p) for the extension of L(p) by the new opera- 
tor V. The formulas of L v (p) are built by the usual formation rules and a new 
variable-binding formation rule for generalized formulas: for each variable v, if 
<p is a formula in L v (p), then so is Vvip. 

Example 3. Consider a signature A with a binary predicate L (on persons). Let 
L(x,y ) stand for u x loves p”. Some assertions expressed by sentences of L V (A) 
are: “people generally love everybody” Vx\/yL(x , y), “somebody loves people in 
general” - 3 x\7yL(x,y), and “people generally love each other” - X7xVyL(x,y). 
Let L(x,y) be “y is taller than x" . We can express “people generally are taller 
than x ” by ’S7yL(x,y) and “x is taller than people in general” by 'S7yL(y,x). 

The semantic interpretation for ‘generally’ is provided by enriching first-order 
structures with families of subsets and extending the definition of satisfaction to 
the quantifier V. 

A filter structure A T = ( A , T) for a signature p consists of a usual structure 
A for p together with a filter T over the universe A of A. We extend the usual 
definition of satisfaction of a formula in a structure under assignment a to its 
(free) variables, using the extension A K [p{a, z)} := {b € A : A K |= <p(u, z)[a, 6]}, 
as follows: for a formula Vz<p(u,z), A K |= \/ zip{u, u)[a] iff -4 K [<p(a, z)\ is in 1C. 

As usual, satisfaction of a formula hinges only on the realizations assigned to 
its symbols. Thus, satisfaction for purely first-order formulas (without V) does 
not depend on the family of subsets. Other semantic notions, such as reduct, 
model {A K 1= r) and validity, are as usual [4, 7]. The behavior of V is interme- 
diate between those of the classical V and 3. 

A deductive system for the logic of ‘generally’ is formalized by adding axiom 
schemata, coding properties of filters, to a calculus for classical first-order logic. 
To set up a deductive system for filter logic one takes a sound and complete 
deductive calculus for classical hrst-orcler logic, with Modus Ponens (MP) as 
the sole inference rule (as in [7]), and extend its set A of axiom schemata by 
adding a set of new axiom schemata (coding properties of filters), to form 
A? := A L)<pf . This set & consists of all the generalizations of the following five 
schemata (where <p, if and 9 are formulas of L v (p)): 

[VV] Vutp — > Vutp; 

[V3] Vv<p — * 3u<p; 

[-» V] \/v(ip -^6)^ (Vvip -► V«0); 

[VA] (Vvip A Vv9) — > Vv(ip A 9); 

[V«] Vvip — > Vup[v := u\, for a new variable u not occurring in tp. 

These schemata express properties of filters, the last one covering alpha- 
betic variants. Other usual deductive notions, such as (maximal) consistent sets, 
witnesses and conservative extension [4, 7], can be easily adapted. So, filter deriv- 
ability amounts to first-order derivability from the filter schemata: r tp iff 
ruA? b ip. Hence, we have monotonicity of \~f and substitutivity of equivalents. 

This deductive system is sound and complete for filter logic, which is a proper 
conservative extension of classical first-order logic. It is not difficult to see that 
we have a conservative extension of classical logic: A h <p iff A \-f tp, for A and 
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<p without V. We have a proper extension of classical logic, because sentences, 
such as 3vS7 zu = z, cannot be expressed without V. 



4 Serial Local Filter Logic 

In this section, we examine modal logics to deal with vague notions. As pointed 
out in Sect. 1, these notions play an important role in computing, knowledge 
representation, natural language processing, game theory, etc. 

In order to introduce the main ideas, consider the following situation. Imagine 
we wish to examine properties of animals and their offspring. For this purpose, 
we consider a universe of animals and binary relation corresponding to “being 
an offspring of’ . Suppose we want to express “every offspring of a black animal 
is dark”; this can be done by the modal formula black — > Ddark. Similarly, 
black — * Odark expresses “some offsprings of black animals are dark” . Now, how 
do we express the vague assertion “most offsprings of black animals are dark” ? 
A natural candidate would be black — » Vdark, where V is the vague modality 
for ‘most’. Here, we interpret Vdark as “a sizable portion of the offsprings is 
dark”. Thus, V captures a notion of “most states among the reachable ones”. 
This is a local notion of vagueness. (In the FOL version, sorted generalized 
quantifiers were used for local notions.) One may also encounter global notions 
of vagueness. For instance, in “most animals are herbivorous”, ‘most’ does not 
seem to be linked to the reachable states (see also Sect. 6). 

The alphabet of serial local filter logic (SLF) is that of basic modal logic with 
a new modality V. The formulas are obtained by closing the set of formulas of 
basic modal logic by the rule: a := Va. 

Frames, models and rooted models of SLF are much as in the basic modal 
logic. For each s £ S, we denote by S R = {t € S : R(s,t)} the set of states in 
the frame that are accessible from s. Semantics of the V-operator is given by a 
family of filters { F s } se s , one for each state in a frame. A model of SLF is 4-tuple 
At = (S,R,F,V), where T = ( S,R ) is a serial frame (R is serial, i.e. , S R ^ 0, 
for all s £ S), V is a valuation, as usual, and F = {F s } se s with F s C 2 bR , a 
filter over S, for each s £ S. 

Satisfaction of a formula a in a rooted arrow model (At , s ) , denoted by 
At, s |= a, is defined as in the basic modal logic, with the following extra clause: 

At, s |= Va iff S R fl At [a] £ F s , 

with At = (S, R , { F s } se s , V) and M[p\ = {s £ S : At, s |= p} being the set of 
states that satisfies a formula ip in a model AL A formula a is a consequence of a 
set of formulas T in SLF, denoted by r |= a, when At, s \= T implies At, s |= a, 
for every rooted arrow model (At, s), as usual. 

A deductive system for SLF is obtained by extendind the deductive system 
for normal modal logic [14] with the axiom [OT] := <>T for seriality and the 
following modal versions of the axioms for filter first-order logic: 
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pVpp -> (S G F s ) 

[V0]V<p - 0? (0 £ F a ) 

[— » V]D(<y9 —y%/))—y (Ep — > Eif) (F s up-closed) 

[VA]V<^ A Exp — > V(<p A ip) (F s fl-closed) 

We write r b a to express that formula a is derivable from set r , in SLF. The 
notion of derivability is defined as usual, considering the rules of necessitation 
and Modus Ponens. 

Completeness 

It is an easy exercise to prove that the Soundness Theorem for SFL, i. e., b C |=. 
We now prove the Completeness Theorem for SLF, i. e., |= C H We use the 
canonical model construction. 

We start with the canonical model M c = (S c , R c , V c ) of basic modal logic 
[3] 1 . Since we have axiom [0T], model Ad c is a serial model 2 . 

It remains to define a family F c of filters over S c . For this purpose, we will 
introduce some notation and obtain a few preliminary results. 

Define [p\ = {E € S c : p G E}, e[p] = eRA [p], and K s = {e[p\ C S c : 
Ep G E}. 

Proposition 1. For every E G S c , (i) eR G Ke, (ii) 0 fL Ke, and (Hi) Ke is 
closed under intersection. 

Proof, (i) For all E G S c , T G E (as E is an MCS). Thus, r[T] = eR D 
[T] = eR- Given E G S c , by Necessitation and [EHV], we have VT G E. Thus 
eR G Ke yf 0- (ii) Assume 0 G Ke- Then, for some formula p, we have e[p\ = 0, 
i. e., Ep G E , for some p. By «>V), we have 0 p G E, i. e., there is some A G eR 
with p G E. But since e[p\ = 0, for all A G eR, p pP A, a contradiction, (iii) 
From [VA] we have e [ p\ D e [ ip\ = e(p A V’]- ■ 

As a result, each family Ke has the finite intersection property. Now, let Fe 
be the closure of Ke under supersets. Note that Fe is a proper filter over eR- 

Proposition 2. e[Q\ £ Ke iff s[0\ G F s . 

Proof. (=>) Clear. (<t=) Suppose e[0\ G Fe- Then, e[0\ A e[p\, for some 
e[<p\ G Ke (i- e., e[9\ A e[p\ and Ep G E , for some p.) Now, U(p — » 9) G E 3 . 
Thus, by [— > V] and Ep G E, we have Ed G E. Hence, e[0\ G Ke- ■ 

Define F c to be {Fe '■ E G S' 0 }. Define the canonical SLF model to be A4 F = 
(S c , R c , F c , V c ). Then we can prove the Satisfability Lemma (A4 F , E \= <j> iff 
(f) G A), by induction on formulas. Completeness is an easy corollary. 

1 S c is the set of maximal consistent sets of formulae. 

2 Recall that R C (P,A) iff Pp, p G A =» Op G T and r R = {A £ S c : R C (P,A)}. 
Also, given P G S c , if ()p G P, then there is some A G rR s. t. p G A [3]. 

3 If e[ 9\ A s[p\ G Ke, then □(<£ — + 9) G E, for if □(<£> — ► 9) 0 E, then there exists 
A G eR s. t. p — *• 9 p A. Thus p A -<0 G A (by consistency and maximality), i. e., 
p G A and 9 p A. Thus we have A G s[p\ C e[9\, as p G A and A G eR- Hence 
9 G A, a contradiction. 
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5 Variants of Vague Modal Logics 

We now comment on some variants of vague modal logics. 



Variants of Local Filter Logics. First note that the choice of serial models 
is a natural one, in the presence of [CIV] and [VO], i- e., [CIV], [VO] h Ocj) — » 
00, whence CT h 0T. An alternative choice would be non-serial local filter 
logics where one takes a filter over the extended universe S R° = {s} U S R, for 
each s € S and the corresponding axiom system {[— * V], [VA], [— > □], [CIO]} + 
{[□V'], [VO']}, where [CV'] := CIO A 0 — 1 ► VO, [VO 7 ] := VO — » 0 v 00> and 
[□0] := CO ~ * 00) M., s |= Va iff s R°PiM [a]. Soundness and completeness 

can be obtained in analogous fashion. 



Other Local Modal Logics. Serial local filter axioms encodes properties of 
filters through [— » V] - closed under supersets, [VA] - closed under intersections, 
and [V0] - non-emptyness axioms. Our approach is modular being easily adapted 
to encode properties of other structures, e. g., to encode families that are up- 
closed, one removes axiom [VA], to encode lattices one replaces [— > V] axiom 
by the [VV], where [W] := V0 A VO — > V(</> V ip). For those systems one 
obtains soundness and completeness results with respect to semantics of the 
V-operator being given by a family of up-closed sets and a family of lattices, 
respectively, along the same lines we provided for SLF logics. (In these cases, one 
takes Fz = Kz in the construction of the canonical model.) 

6 Conclusions 

Assertions and arguments involving vague notions, such as ‘generally’, ‘most’ 
and ‘typical’ occur often in ordinary language and in some branches of science. 
Modal logics are specification mechanisms simpler to handle than first-order 
logic. 

We have examined a modal logic, with a new generality modality for express- 
ing and reasoning about a local version of ‘most’ as motivated by the hereditary 
example in Sect. 4. We presented a sound and complete axiom system for local 
generalized modal logic, where the locality aspect corresponds to the intended 
meaning of the generalized modality: “most of the reachable states have the 
property”. (We thank one of the referees for an equivalent axiomatization for 
SLF. It seems that it works only for filters, being more difficult to adapt to other 
structures.) 

Some global generalized notions could appear in ordinary language, for in- 
stance; “most black animals will have most offspring dark”. The first occurrence 
of ‘most’ is global (among all animals) while the second is a local one (referring 
to most offspring of each black animal considered). In this case one could have 
two generalized operators: a global one, V, and a local one, V. Semantically V 
would refer to a filter (over the set of states) in a way analogous to the universal 
modality [8]. 
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Other variants of generalized modal logics occur when one considers multi- 
modal generalized logics as motivated by the following example. In a chess game 
setting, a state is a chessboard configuration. States can be related by different 
ways, depending on which piece is moved. Thus, one would have Q{s,t) for: t is 
a chessboard configuration resulting from a queen’s move (in state s), P(s, t) for: 
t is the chessboard configuration resulting from a pawn’s move (in state s), etc. 
This suggests having Vp, Vq, etc. Note that with pawn’s moves one can reach 
fewer states of the chessboard than with queen’s moves, i. e., {t G S : Q(s,t)} 
is (absolutely) large, while {t G S : P(s,t)} is not. Thus, we would have v q t 
holding in all states and ^V P T not holding in all states. On the other hand, 
among the pawn’s moves many may be good, that is: {f £ S': P(s, i), good(f)} 
is large within {feS: P(s,t)} (i. e. {t G S : P(s, t), good(f)} G Kf .) 

In this fashion one has a wide spectrum of new modalities and relations 
among them to be investigated. We hope the ideas presented in this paper pro- 
vide a first step towards the development of a modal framework for generalized 
logics where vague notions can be represented and be manipulated in a precise 
way and the relations among them investigated (e. g. relate important with very 
important, etc.). By setting this analysis in a modal environment one can further 
take advantage of the machinery for modal logics [3], adapting it to these logics 
for vague notions. 
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Abstract. The aim of this paper is to study a family of logics that ap- 
proximates classical inference, in which every step in the approximation 
can be decided in polynomial time. For clausal logic, this task has been 
shown to be possible by Dalai [4, 5]. However, Dalai’s approach cannot be 
applied to full classical logic. In this paper we provide a family of logics, 
called Limited Bivaluation Logics, via a semantic approach to approx- 
imation that applies to full classical logic. Two approximation families 
are built on it. One is parametric and can be used in a depth-first ap- 
proximation of classical logic. The other follows Dalai’s spirit, and with 
a different technique we show that it performs at least as well as Dalai’s 
polynomial approach over clausal logic. 



1 Introduction 

Logic has been used in several areas of Artificial Intelligence as a tool for mod- 
elling an intelligent agent reasoning capabilities. However, the computational 
costs associated with logical reasoning have always been a limitation. Even if 
we restrict ourselves to classical propositional logic, deciding whether a set of 
formulas logically implies a certain formula is a co-NP-complete problem [9] . 

To address this problem, researchers have proposed several ways of approx- 
imating classical reasoning. Cadoli and Sclraerf have proposed the use of ap- 
proximate entailment as a way of reaching at least partial results when solving 
a problem completely would be too expensive [13]. Their influential method is 
parametric, that is, a set S of atoms is the basis to define a logic. As we add more 
atoms to S, we get “closer” to classical logic, and eventually, when S contains all 
propositional symbols, we reach classical logic. In fact, Schaerf and Cadoli pro- 
posed two families of logic, intending to approximate classical entailment from 
two ends. The S 3 family approximates classical logic from below, in the follow- 
ing sense. Let V be a set of propositions and 0 C S' C S" C . . . C V\ let |=| 
indicate the set of the entailment relation of a logic in the family. Then: 

|=|c|=|, C|=!"C ...c\=^ = |= CL 
where CL is classical logic. 

* Partly supported by CNPq grant PQ 300597/95-5 and FAPESP project 03/00312-0. 
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Approximating a classical logic from below is useful for efficient theorem prov- 
ing. Conversely, approximating classical logic from above is useful for disproving 
theorems, which is the satisfiability (SAT) problem and has a similar formula- 
tion. In this work we concentrate only in theorem proving and approximations 
from below. 

The notion of approximation is also related with the notion of an anytime 
decision procedure, that is, an algorithm that, if stopped anytime during the 
computation, provides an approximate answer, that is, an answer of the form 
“up to logic Li in the family, the result is/is not provable”. This kind of anytime 
algorithms have been suggested by the proponents of the Knowledge Compilation 
approach [14, 15], in which a theory was transformed into a set of polynomially 
decidable Horn-clause theories. However, the compilation process is itself NP- 
complete. 

Dalai’s approximation method [4] was the first one designed such that each 
reasoner in an approximation family can be decided in polynomial time. Dalai’s 
initial approach was algebraic only. A model-theoretic semantics was provided 
in [5]. However, this approach was restricted to clausal form logic only. 

In this work, we generalize Dalai’s approach. We create a family of logics 
of Limited Bivalence (LB) that approximates full propositional logic. We pro- 
vide a model-theoretic semantics and two entailment relations based on it. The 
entailment |=^? is a parametric approximation on the set of formulas £ and fol- 
lows Cadoli and Schaerf’s approximation paradigm. The entailment |=^ B follows 
Dalai’s approach and we show that for clausal form theories, the inference )=^ B 
is polynomially decidable and serves as a semantics for Dalai’s inference l- PCP . 

This family of approximations is useful in defining families of efficiently de- 
cidable formulas with increasing complexity. In this way, we can define the set 
L\ = {a| |=); B a and k} of tractable theorems, such that A C A+i- 

This paper proceeds as follows. Next section briefly presents Dalai’s approxi- 
mation strategy, its semantics and discuss its limitations. In Section 3 we present 
the family LB(A) of Limited Bivaluation Logics; the semantics for full proposi- 
tional LB(A) is provided in Section 3.1; a parametric entailment is presented 
in Section 3.2. The entailment |=J: B is presented in Section 3.4 and the soundness 
and completeness of Dalai’s b PCP with respect to |=): B is Shown in Sections 3.3 
and 3.4. 

Notation: Let V be a countable set of propositional letters. We concentrate on 
the classical propositional language Cc formed by the usual boolean connectives 
— » (implication), A (conjunction), V (disjunction) and -i (negation). 

Throughout the paper, we use lowercase Latin letters to denote proposi- 
tional letters, a, (3 , 7 denote formulas, denote clauses and A denote a lit- 
eral. Uppercase Greek letters denote sets of formulas. By atoms(a) we mean 
the set of all propositional letters in the formula a; if £ is a set of formulas, 
atoms(A) = (J ae ^ atoms (a). 

Due to space limitations, some proofs of lemmas have been omitted. 




Towards Polynomial Approximations of Full Propositional Logic 



13 



2 Dalai’s Polynomial Approximation Strategy 

Dalai specifies a family of anytime reasoners based on an equivalence relation 
between formulas [4]. The family is composed of a sequence of reasoners I— o, l“i 
, . . such that each by is tractable, each h i+ i is at least as complete (with respect 
to classical logic) as bj, and for each theory there is a complete hi to reason with 
it. 

The equivalence relation that serves as a basis for the construction of a family 
has to obey several restrictions to be admissible, namely it has to be sound, 
modular, independent, irredundand and simplifying [4]. 

Dalai provides as an example a family of reasoners based on the classically 
sound but incomplete inference rule known as BCP (Boolean Constraint Propa- 
gation) [12], which is a variant of unit resolution [3]. For the initial presentation, 
no proof-theoretic or model-theoretic semantics were provided for BCP, but an 
algebraic presentation of an equivalence relation = BCP was provided. For that, 
consider a theory as a set of clauses, where a disjunction of zero literals is de- 
noted by f and the conjunction of zero clauses is denoted t. Let ->p denote the 
negation of the atom p, and let ~ ip be the complement of the formula ip obtained 
by pushing the negation inside in the usual way using De Morgan’s Laws until 
the atoms are reached, at which point = -<p and = p. 

The equivalence relation = BCP is then defined as: 

{f}ur= BCP {f} 

{A, ~ A V Ai V . . . V A„} U T = BCP {A, Ar V . . . V A„} U r 
where A, A j are literals. 

The idea is to use an equivalence relation to generate an inference in which 
if can be inferred from T if T U {~ip} is equivalent to an inconsistency. In this 
way, the inference h BCP is defined as T h BCP ip iff T U {~?/>} = BC p {f }- 

Dalai presents an example 1 in which, for the theory To = {p\J q,pW —iq, ->pV 
s V t,~<pV s V -<t} we both have To b BCP p and To,p h BCP s but To l/ BCP s. 

This example shows that h BCP is unable to use a previously inferred clause 
p to infer s. Based on this fact comes the proposal of an anytime family of 
reasoners. 



2.1 The Family of Reasoners 

BCP BCP B* 

Dalai defines a family of incomplete reasoners h 0 , hq , . . ., where each \~ k 
is given by the following: 



1 . 



Tk 



<P 



r u BCP 

<P 



2 . 



r , BCP , 



r,ipv-T y 



TK 



for \ip\ < k 






where the size of a clause ip, \ip\ is the number of literals it contains. 
1 This example is extracted from [5]. 
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The first rule tells us that every h BCP -inference is also a b fc -inference. The 
second rule tells us that if ip was inferred from a theory and it can be used as a 
further hypothesis to infer tp, and the size of ip is at most k, then tp is can also 
be inferred from the theory. 

Dalai shows that this is indeed an anytime family of reasoners, that is, for 

BCP • BCP • , BCP 

each k. b fc is tractable, b fc+1 is as complete as \~ k and if you remove the 
restriction on the size of ip in rule 2, then \~ k becomes complete, that is, for 
each classically inferable r b tp there is a k such that r b fc ° tp. 

2.2 Semantics 

In [5], Dalai proposed a semantics for b fc C based on the notion of k-valuations, 
which we briefly present here. 

Dalai’s semantics is defined for sets of clauses. Given a clause ip, the support- 
set of ip, S(ip) is defined as the set of all literals occurring in ip. Support sets 
ignore multiple occurrences of the same literal and are used to extend valuations 
from atoms to clauses. According to Dalai’s semantics, a propositional valuation 
is a function v : V — > [0, 1]; note that the valuation maps atoms to real numbers. 
A valuation is then extended to literals and clauses in the following way: 

1. v(->p) = 1 — v(p) for any atom p £ V. 

2. v(ip) = Ea S 5(^) u ( a )> for an y clause 

Valuations of literals are real numbers in [0, 1], but valuations of clauses are 
non-negative real numbers that can exceed 1. A valuation v is a model of ip 
if v(ip) > 1. A valuation is a countermodel of ip if v(ip) = 0. Therefore it is 
possible for a formula to have neither a model nor a countermodel. For instance, 
if v(j>) = v(q) = 0.2, then p V q has neither a model nor a countermodel. A 
valuation is a model of a theory (set of clauses) if it is a model of all clauses in 
it. 

Define r\a ip iff no model of the theory r is a countermodel of ip. 

Proposition 1 ([5]). For every theory r and every clause ip, r b BCP ip iff 
' ip . 

So b BCP is sound and complete with respect to |=a. The next step is to gener- 
alize this approach to obtain a semantics of b fc . For that, for any k > 0, a set 
V of valuations is a k -valuation iff for each clause ip of size at most k, if V has 
a non-model of ip then V has a countermodel of ip. V is a k-model of ip if each 
v £ V is a model of ip\ this notion extends to theories as usual. 

It is then possible to define rfck ip iff there is no countermodel of ip in any 
/c-model of T. 

Proposition 2 ([5]). For every theory r and every clause ip, r b fc C ip iff 
r^k ip. 

Thus the inference b fc is sound and complete with respect to («*,. 




Towards Polynomial Approximations of Full Propositional Logic 



15 



2.3 Analysis of \~ k 

Dalai’s notion of a family of anytime reasoners has very nice properties. First, 
every step in the approximation is sound and can be decided in polynomial 
time. Second, the approximation is guaranteed to converge to classical inference. 
Third, every step in the approximation has a sound and complete semantics, 
enabling an anytime approximation process. 

However, the method based on \~ k -approximations also has its limitations: 

1. It only applies to clausal form formulas. Although every propositional for- 
mula is classically equivalent to a set of clauses, this equivalence may not 
be preserved in any of the approximation steps. The conversion of a formula 
to clausal form is costly: one either has to add new propositional letters 
(increasing the complexity of the problem) or the number of clauses can be 
exponential in the size of the original formula. With regards to complexity, 
BCP is a form of resolution, and it is known that there are theorems that 
can be proven by resolution only in exponentially many steps [2]. 

2. Its non-standard semantics makes it hard to compare with other logics known 
in the literature, specially other approaches to approximation. Also, the se- 
mantics presented is based on support sets, which makes it impossible to 
generalize to non-clausaljprmulas. 

3. The proof-theory for \~ k is poor in computational terms. In fact, if we 
are trying to prove that T \~ k y, and we have shown that r l/ BCP tp, 
then we would have to guess a ip with \ip\ < k, so that T l- PCP ip and 
r, ip b fc ° <p. Since the BCP-approximations provide no method to guess the 
formula ip, this means that a computation would have to generate and test 
all the 0((2n) k ) possible clauses, where n is the number of propositional 
symbols occurring in r and ip. 

In the rest of this paper, we address problems 1 and 2 above. That is, we are 
going to present a family of anytime reasoners for the full fragment of propo- 
sitional logic, in which every approximation step has a semantics and can be 
decided in polynomial time. Problem 3 will be treated in further work. 

3 The Family of Logics LB(X') 

We present here the family of logics of Limited Bivalence , LB(U). This is a para- 
metric family that approximates classical logic, in which every approximation 
step can be decided in polynomial time. Unlike \~ k , LB(T) is parameterized 

by a set of formulas A; when £ contains all formulas of size at most k, LB(I7) 

. BCP 

can simulate an approximation step of b fc 

The family LB(U) can be applied to the full language of propositional logic, 
and not only to clausal form formulas, with an alphabet consisting of a countable 
set of propositional letters (atoms) V = {po,p\, . . .}, and the connectives A, 
V and — >, and the usual definition of well- formed propositional formulas; the set 
of all well-formed formulas is denoted by C. The presentation of LB is made in 
terms of a model theoretic semantics. 
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3.1 Semantics of LB(U) 

The semantics of LB(I7) is based of a three-level lattice, ( L , n, U, 0, 1), where L is 
a countable set of elements L = {0, 1, £o, £i, £ 2 , • • •}, LI is the least upper bound, 
n is the gratest lower bound, and C is defined, as usual, as a C b iff a U b = b iff 
a n b = a; 1 is the L-top and 0 is the C-bottom. L is subject to the conditions: 
(i) 0 C £> E 1, for every i < ui: and (ii) % £j for i ^ j. This three-level lattice 
is illustrated in Figure 3.1(a). 



1 




0 

(a) The 3-Level Lattice 



L 1 1 ) 

£0 £1 



I 

f\ 

1' 
1 ' 
1 ' 

I I 
\l 
0 



/ 1 



(b) The Converse Operation ~ 



This lattice is enhanced with a converse operation, ~, defined as: ~ 0 = 1, 
~1 = 0 and = £i for all i <u>. This is illustrated in Figure 3.1(b). 

We next define the notion of an unlimited valuation, and then we present 
its limitations. An unlimited propositional valuation is a function vs '■ V — > L 
that maps atoms to elements of the lattice. We extend vs to all propositional 
formulas, vs '■ C — > L, in the following way: 

vs(^a) =~vz{a) 
vs (a A P) = vs (a) n vs(( 3 ) 
vs (ct V ( 3 ) = vs (a) U vs {/ 3 ) 
vs (a —> P) = {~vs(a)) U vs(/ 3 ) 

A formula can be mapped to any element of the lattice. However, the formulas 
that belong to the set £ are bivalent, that is, they can only be mapped to the 
top or the bottom element of the lattice. Therefore, a limited valuation must 
satisfy the restriction of Limited Bivalence given by, for every a € C: 

vs (a) = 0 or Vs(ct) = 1, if a £ £. 

In the rest of this work, by a valuation vs we mean a limited valuation 
subject to the condition above. 

A valuation vs satisfies a if Vs(ct) = 1, and a is said satisfiable; a set of 
formulas r is satisfied by 1 >s if all its formulas are satisfied by Vs- A valuation 
Vs contradicts a if Vs(cd) = 0 ; if a is neither satisfied nor contradicted by vs, 
we say that Vs is neutral with respect to a. A valuation is classical if it assigns 
only 0 or 1 to all proposition symbols, and hence to all formulas. 

For example, consider the formula p — > q, and £ = 0. Then 

- if vs(p) = 1, then v s (p -> q) = vs{q)\ 

- if vs(p) = 0, then vs(p —><?) = 1; 
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- if vs(q) = 0, then vs(p — » q) = vs{p)\ 

- if vz(q) = 1, then vs(p — > q) = 1; 

- if vs(p) = £ p and Vs(q) = e q , then vs(p — > q) = 1; 

The first four valuations coincide with a classical behavior. The last one 
shows that if p and q are mapped to distinct neutral values, then p — > q will be 
satisfiable. Note that, in this case, p\/ q will also be satisfiable, and that p A q 
will be contradicted. 



3.2 LB-Entailment 

The notion of a parameterized LB-Entailment, |=ip follows the spirit of Dalai’s 
entailment relation, namely J |=ip a if it is not possible to satisfy J and con- 
tradict a at the same time. More specifically, J Y^s a if no valuation vs such 
that vs(r) = 1 also makes vs(a) = 0. Note that since this logic is not classic, 
if J |=if a and Vs(T) = 1 it is possible that the ccr is either neutral or satisfied 
by vs- 

For example, we reconsider Dalai’s example, where Jo = {p V q, p V -<q, —>p V 
sVt, -ipVsV-if} and make £ = 0. We want to show that r 0 bvp p, r 0 ,p |=vp s 
but Jq s - 

To see that Jo |=i? p, suppose there is a vs such that vs(p) = 0. Then we 
have Vs(p\/ q) = Vs{q) and vs(p\/ -<q) =~vs(q)- Since it is not possible to 
satisfy both, we cannot have us (Jo) = 1, so Jo |=i? p. 

To show that r 0 ,p |=if s, suppose there is a vs such that vs(s) = 0 and 
v s {p) = 1. Then vs{~^p V s V t) = vs(t) and vs{~^p V s V “if) =~vs{t). Again, 
it is not possible to satisfy both, so Jo,p f=i? s. 

Finally, to see that Jo Y^s s i take a valuation vs such that Vs(s) = 
0 ,vs(p) — £p,vs(q) = £q,vs(t) = £f Then ^(Jo) = 1 . However, if we make 
£ = {p} then we have only two possibilities for vs(p)- If vs(p) = 1, we have 
already seen that no valuation that contradicts s will satisfy Jo. If vs(p) = 0, we 
have also seen that no valuation that contradicts s will satisfy Jo . So for p £ £, 
we obtain Jo |=if s. 

This example indicates that beP behave in a similar way to b BCP , and that 
by adding an atom to £ we have a behavior similar to bj . We now have to 
demonstrate that this is not a mere coincidence. 



An Approximation Process. As defined in [8], a family of logics, parameter- 
ized with a set, £, is said to be an approximation of classical logic “from below” 
if, for increasing size of the parameter set £ we get closer to classical logic. That 
is, for 0 C £' C £" C ... C C we have that, 

b^ B c |=£ c |=LB, C ... c = bcL 



Lemma 1. The family of logics LB(J) is an approximation of classical logic 
from below. 
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Note that for a given pair (F, a) the approximation of F |= a can be done 
in a finite number of steps. In fact, if (3 , 7 £ £ any formula made up of f3 and 
7 has the property of bi valence. In particular, if all atoms of r and a are in £, 
then only classical valuations are allowed. 

An approximation method as above is not in the spirit of Dalai’s approxi- 
mation, but follows the paradigm of Cadoli and Schaerf [13,1], also applied by 
Massacci [11,10] and Finger and Wassermann [ 6 - 8 ]. 

We now show how Dalai’s approximations can be obtained using LB. 

3.3 Soundness and Completeness of b BCP with Respect to |=^ B 

For the sake of this section and the following, let f be a set of clauses and let 
ip and ip denote clauses, and A, Ai, A 2 , . . . denote literals. We now show that, for 

u = 0 , r b BCP ip iff r b 0 B ip- 

Lemma 2 . Suppose BCP transforms a set of clauses r into a set of clauses A, 
then v s (r ) = 1 iff v s (A) = 1. 

Lemma 3. r =bcp {f } iff for all valuations v 0 , u 0 (F) ^ 1. 

Theorem 1. Let r be a set of clauses and ip a clause. Then T b BCP ip iff 

r hk, B r 

Proof, r ]=k, B ip iff for no v 0 , u 0 (F) = 1 and v 0 (ip) = 0 iff for no u 0 , i> 0 (F U 
-iip) = 1 iff, by Lemma 3, F U -up = BC p {f} iff F b BCP ip. 

Lemma 4 (Deduction Theorem for b BCP ). Let. F be a set of clauses, A a 
literal and ip a clause. Then the following are equivalent statements: 

(a) F, A b BCP ip; (b) F b BCP ->A V ip; (c) F b BCP A ip. 

3.4 Soundness and Completeness of b fc 

As mentioned before, the family of entailment relations |=^P does not follow 
Dalai’s approach to approximation, so in order to obtain a sound and complete 
semantics for b fc we need to provide another entailment relation based on |=j?, 
which we call |=§ B . 

For that, let S be a set of sets of formulas and define F |=g B ip iff there 
exists a set £ £ S such that F |= l |? ip. We concentrate on the case where F 
is a set of clauses, ip is a clause and each £ e S is a set of atoms. We define 
S k = {£CV\ \£\ = k}. 

That is, Sfc is a set of sets of atoms of size k. Note that if we restrict our 
attention to n atoms, |Sfc | = ^J = 0(n k ) sets of k atoms. For a fixed k, we only 
have to consider a polynomial number of sets of k atoms. 

We then write to mean |=g B . 

Theorem 2. Let F be a set of clauses and ip a clause. Then F \~ k ip iff 

r K B r 
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Proof. (=>) By induction on the number of uses of rule 2 in the definition of 

BCP BCP 

\~ k . For the base case, Theorem 1 gives us the result. Assume that r b fc ip 

due to r b^ CP ip and P, p l- PCP ip. Suppose for contradiction that P y= k B ip, then 
for all PCP, |P| < k, there exists vs such that Vs(r) = 1 and Vs(ip) = 0. By 
the induction hypothesis, P ^): B ip, which implies Vs(p) yd 0, and P, ip ^|: B p, 
which implies Vs(p) yd 1. So v a (p) = for some i < u>, which implies that 
atoms(y) n P = 0, but this cannot hold for all P, a contradiction. So P |=J: B ip. 

(-<=) Suppose P |=): B ip. Then for some P with |P| < k, P ^^ B ip and 
suppose that P is a smallest set with such property. Therefore, for all with vs 
with Vs(r) = 1 we have Vs(ip) yd 0. Choose one such vs and define the set of 
literals A = {A is a literal whose atom is in P|uj;(A) = 1}. 

We first show that P (=^f A for every A € A. Suppose for contradiction that 
for some A € A, P y= L ® A, then there is a v' s with v' s (r) = 1 and v' s (ip) yd 0 
but v' s (X) = 0. Let atoms(A) = {p}. If p does not occur in ip, then P |=i; B -{p} ip, 
which contradicts the minimality of P. So ip = p V or ip = ~<P V x" ■ Consider 
a Vs~{ P } such that Vs-{ P }(r) = 1; if V£-{ p } maps p to 0 or 1 it is a Vs, 
so vs-{ p }(ip) yf 0; if Vs-{ P }(p) = £i for some i, then clearly we have that 
V s-{ P } ('*/’) / 0, so P l=i; B -{p} which contradicts the minimality of P. It 
follows that P |=^ B A. 

We now show that PUA b BCP ip. Suppose for contradiction that PUA I / BCP ip. 
Then, by Theorem 1, PUA ip 7 that is, there exists V 0 such that tJ0(PUA) = 1 
and i) 0 (ip) = 0. However, such V 0 maps all atoms of P to 0 or 1, so it is actually 
aoj that contradicts P ip. So PUd b BCP ip. 

If P b BCP ip then clearly P \~ k ip. So suppose P l/ BCP ip. In this case, we 
show that P \~ k f\ A. Let A = { Ai , . . . , A m }, we prove by induction that for 
1 < i < m, r, Ai, . . . , Aj_i b BCP A j. From P l/ BCP ip and Theorem 1 we know that 
there is a valuation V 0 such that v&(r) = 1 and Vz(ip) = 0. From PUd b BCP ip 
we infer that there must exist a A £ A such that ^^(A) yd 1; without loss of 
generality, let A = A m . Suppose for contradiction that P, Ai, . . . , \ m -i Vbcp 
A m . Then there exists a valuation v ' 0 such that v' 0 (r, Ai,...,A m _i) = 1 but 
v ' 0 (\m) = 0, which contradicts P A. So P, Ai, . . . , A m _i b BCP A m . 

Now note that for 1 < i < m, P, Aj, . . . , A m l/ BCP ip, otherwise the minimality 
of P would be violated. From Theorem 1 we know that there is a valuation v 0 
such that v 0 (r, A A m ) = 1 and v 0 (ip) = 0. From P U A b BCP ip we infer 
that there must exist a A G {Ai, . . . , A^_i } such that v 0 (X) yd 1; without loss 
of generality, let A = A;_i. Suppose for contradiction that P, Ai, . . . , Xi -2 l/ BCP 
Ai_i. Then there exists a valuation v ' 0 such that v 0 (r, Ai, . . . , Xi-2) = 1 but 
v' 0 (Xi-i) = 0, but this contradicts P |= l yP A. So P, Ai, . . . , Xi -2 ^bcp A,_i. 

Thus we have that P b BCP Ai; P, Ai b BCP A 2 ; . . . ; P, Ai, . . . , A m _i b BCP X m . It 

BCP BCP 

follows that P \~ k f\A as desired. Finally, from P U A b BCP ip and P \~ k /\ A 

we obtain that P b fc ip, and the result is proved. 

The technique above differs considerably from Dalai’s use of the notion of 
vividness. It follows from Dalai’s result that each approximation step |=|: B is 
decidable in polynomial time. 
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4 Conclusions and Future Work 

In this paper we presented the family of logics LB (If) and provided it with a 

lattice-based semantics. We showed that it can be a basis for both a parametric 

and a polynomial clausal approximation of classical logic. This semantics is sound 

and complete with respect to Dalai’s polynomial approximations \~ k 

Future work should extend polynomial approximations to non-clausal logics. 

It should also provide a proof-theory for these approximations. 
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Abstract. One of the main problems in using logic for solving problems 
is the high computational costs involved in inference. In this paper, we 
propose the use of a notion of relevance in order to cut the search space 
for a solution. Instead of trying to infer a formula a directly from a large 
knowledge base K, we consider first only the most relevant sentences in 
K for the proof. If those are not enough, the set can be increased until, 
at the worst case, we consider the whole base K. 

We show how to define a notion of relevance for first-order logic with 
equality and analyze the results of implementing the method and testing 
it over more than 700 problems from the TPTP problem library. 

Keywords: Automated theorem provers, relevance, approximate rea- 
soning. 



1 Introduction 

Logic has been used as a tool for knowledge representation and reasoning in 
several subareas of Artificial Intelligence, from the very beginning of the field. 
Among these subareas, we can cite Diagnosis [1], Planning [2], Belief Revision 
[3], etc. 

One of the main criticisms against the use of logic is the high computational 
costs involved in the process of making inferences and testing for consistency. 
Testing satisfiability of a set of formulas is already an NP-complete problem 
even if we stay within the realms of propositional logic [4]. And propositional 
logic is usually not rich enough for most problems we want to represent. Adding 
expressivity to the language comes at the cost of adding to the computational 
complexity. 

In the area of automatic theorem proving [5] , the need for heuristics that help 
on average cases has long been established. Recently, there have been several 
proposals in the literature of heuristics that not only help computationally, but 
are also based on intuitions about human reasoning. In this work, we concentrate 
on the ideas of approximate reasoning and the use of relevance notions. 

Approximate reasoning consists in, instead of attacking the original problem 
directly, performing some simplification such that, if the simplified problem is 



A.L.C. Bazzan and S. Labidi (Eds.): SBIA 2004, LNAI 3171, pp. 21—30, 2004. 
(c) Springer- Verlag Berlin Heidelberg 2004 
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solved, the solution is also a solution for the original problem. If no solution is 
found, then the process is restarted for a problem with complexity lying between 
those of the original and the simplified problem. 

That is, we are looking for a series of deduction mechanisms {hi, I— 2 , 
with b, computationally less expensive than \~j for i < j, such that if Th(\~i) 
represents the theorems which can be proved using bj, and b is a sound and 
complete deduction mechanism for classical logic, we get: Th(\~i) C T/i(b 2 ) C 
. . . C Th{}r) 

An example of such kind of system is Schaerf and Cadoli’s “Approximate 
Entailment” [6] for propositional logic. The idea behind their work is that at 
each step of the approximation process, only some atoms of the language are 
considered. 

Given a set S of propositional letters, their system S 3 disconsiders those 
atoms outside S by allowing both p and ->p to be assigned the truth value 1 
when p is not in S. If p is in S, then its behavior is classic, i.e., p is assigned 
the truth value 1 if and only if is assigned 0. The system S 3 is sound but 
incomplete with respect to classical logic. This means that for any S , if a formula 
is an S 3 consequence of a set of formulas, it is also a classical consequence. Since 
the system is incomplete, the fact that a formula does not follow from the set 
according to S 3 does not give us information about its classical status. 

There are several other logical systems found in the literature which are also 
sound and incomplete, such as relevant [7] and paraconsistent logics [8]. 

In this work, we present a sound an incomplete system based on a notion of 
relevance. We try to prove that a sentence a follows from a set of formulas K by 
first considering only those elements of K which are most relevant to a. If this 
fails, we can add some less relevant elements and try again. In the worst case, 
we will end up adding all the elements of K , but if we are lucky, we can prove 
a with less. 

The system presented here is based on the one proposed in [9]. The original 
framework was developed for propositional logic. In this paper, we extend it to 
deal with first order logic and show some empirical results. 

The paper proceeds as follows: in the next section, we present the idea of 
using a relevance graph to structure the knowledge base, proposed in [9]. In 
Section 3, we introduce a particular notion of relevance, which is based purely 
on the syntactical analysis of the knowledge base. In Section 4, we show how 
these ideas were implemented and the results obtained. We finally conclude and 
present some ideas for future work. 

2 The Relevance Graph Approach 

In this section, we assume that the notion of relevance which will be used is given 
and show some general results proven in [9]. In the next section, we consider a 
particular notion of relevance which can be obtained directly from the formulas 
considered, without the need of any extra-logical resources. 

Let 1Z be a relation between two formulas with the intended meaning that 
lZ(a, (3 ) if and only if the formulas a and f3 are directly relevant to each other. 




Using Relevance to Speed Up Inference 



23 



Given such a relatedness relation, we can represent a knowledge base (a set of 
formulas) as a graph where each node is a formula and there is an edge between 
p and if if and only if T Z(p,if). This graph representation gives us immediately 
a notion of degrees of relatedness: the shorter the path between two formulas 
of the base is, the closer related they are. Another notion made clear is that 
of connectedness: the connected components partition the graph into unrelated 
“topics” or “subjects” . Sentences in the same connected component are somehow 
related, even if far apart (see Figure 1). 





Fig. 1. Structured Knowledge Base 



Fig. 2. Degrees of Relevance 



Definition 1. [9] Let K be a knowledge base and TZ be a relation between 
formulas. A TZ-path between two formulas p and if in K is a sequence P = 
(po,pi, ...,p n ) of formulas such that: 

1. pq = p and p n = ip 

2. {px, ...,p n - 1 } C K 

3. TZ(p,pi), TZ(p 1 ,p 2 ), andTZ(p n ,if). 

If it is clear from the context to which relation we refer we will talk simply 
about a path in K . 

p 

We represent the fact that P is a path between p and if by p if. 

The length of a path P = (po, pi , ..., p n ) Is l(P) = n 

Note that the extremities of a path in K are not necessarily elements of K. 

Definition 2. [9] Let K be a knowledge base and TZ a relation between formulas 

of the language. We say that two formulas p and if are related in K by TZ if and 

P 

only if there is a path P such that p^if. 

Given two formulas p and if and a base K , we can use the length of the 
shortest path between them in K as the degree of unrelatedness of the formulas. 
If the formulas are not related in K , the degree of unrelatedness is set to infinity. 
Formulas with a shorter path between them in K are closer related in K . 
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Definition 3. [9] Let K be a knowledge base, TZ a relation between formulas of 
the language and <p and if formulas. The unrelatedness degree of p and if in K 
is given by: 

{ 0 if p = if and p £ K 

min{l(P)\p if} if :p and if are related in K by 1Z 
oo otherwise 

We now show, given the structure of a knowledge base, how to retrieve the 
set of formulas relevant for a given formula a: 

Definition 4. [9] The set of formidas of K which are relevant for a with degree 
i is given by: 

A 1 (a, I\) ={(/?£ K\u(a, ip) = i} for i > 0 

Definition 5. [9] 

The set of formulas of K which are relevant for a up to degree n is given by: 
A- n (a, I<) = U 0 <i<n K ) f° r n >° 

We say that A- u {a,K) = IJj>o A l (a,K) is the set of relevant formulas 
for a. 

In Figure 2, we see an example of a structured knowledge base K = {a, (3, 7, 
S, s, 77 , 6 , l, k, A, p, v, o, 7 r, p, a , <p, ip , %}• The dotted circles represent different 
levels of relevance for a. We have: 

A°(a,K) = {a} 

A 1 (a, K) = {/?,x,5,e} 

A 2 (a, I<) = {7, 7 ?, t, ip, k, A, p, <f} 

A 3 (a,K ) = {v, o,n,0, p, cr} 

A^ u (a, K ) = A°(a, K) U A^a, K) U A 2 (a , K) U A 3 (a , K) = K 
We can now define our notion of relevant inference as: 

Definition 6. K h, a if and only if A- l (a, K) b a 

Since A- q ’(a,K) is a subset of K, it is clear that if for any i, K bj a, then 
K b a. Note however that if K l/j a, we cannot say anything about whether 
K b a or not. 

An interesting point of the framework above is that it is totally indepen- 
dent on which relevance relation is chosen. In the next section, we explore one 
particular notion of relevance, which can be used with this framework. 



3 Syntactical Relevance 

We have seen that, given a relevance relation, we can use it to structure a 
set of formulas so that the most relevant formulas can be easily retrieved. But 
where does the relevance relation come from? Of course, we could consider very 
sophisticated notions of relevance. But in this work, our main concern is to find 
a notion that does not require that any extra information is added to the set I\ . 
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In [9], a notion of syntactical relevance is proposed (for propositional logic), 
which makes lZ(a, (3) if and only if the formulas a and (3 share an atom. It can 
be argued that this notion is very simplistic, but it has the advantage of being 
very easy to compute (this is the relation used in Figure 1). We can also see that 
it gives intuitive results. Consider the following example, borrowed from [10] 1 . 

Example 1. Consider Paul, who is finishing school and preparing himself for the 
final exams. He studied several different subjects, like Mathematics, Biology, 
Geography. His knowledge base contains (among others) the beliefs in Figure 3. 

When Paul gets the exam, 
the first question is: Do cows 
have molar teeth? 

Of course Paul cannot rea- 
son with all of his knowledge 
at once. First he recalls what 
he knows about cows and about 
molar teeth: 

Cows eat grass. 

Mammals have canine teeth 
or molar teeth. 

From these two pieces of 
knowledge alone, he cannot an- 
swer the question. Since all he 
knows (explicitly) about cows is 
that they eat grass, he recalls 
what he knows about animals 
that eat grass: 

Animals that eat grass do not have canine teeth. 

Animals that eat grass are mammals. 

From these, Paul can now derive that cows are mammals, that mammals 
have canine teeth or molar teeth, but that cows do not have canine teeth, hence 
cows have molar teeth. 

The example shows that usually, a system does not have to check its whole 
knowledge base in order to answer a query. Moreover, it shows that the process 
of retrieving information is made gradually, and not in a single step. If Paul had 
to go too far in the process, he would not be able to find an answer, since the 
time available for the exam is limited. But this does not mean that if he was 
given more time later on, he would start reasoning from scratch: his partial (or 
approximate) reasoning would be useful and he would be able to continue from 
more or less where he stopped. 

Using the syntactical notion of relevance, the process of creating the relevance 
graph can be greatly simplified. The graph can be implemented as a bipartite 
graph, where some nodes are formulas and some are atoms. The list of atoms 
is organized in lexicographic order, so that it can be searched efficiently. For 
every formula which is added to the graph, one only has to link it to the atoms 

1 The example is based on an example of [6]. 



Triangles are polygons. 

Triangles with one right angle are Pythagorean. 
Rectangles are polygons. 

Rectangles have four right angles. 

Cows eat grass. 

Dogs are carnivore. 

Animals that eat grass do not have canine teeth. 
Carnivorous animals are mammals. 

Mammals have canine teeth or molar teeth. 
Animals that eat grass are mammals. 

Mammals are vertebrate. 

Vertebrates are animals. 

Brazil is in South America. 

Volcanic soil is fertile. 



Fig. 3. Student’s knowledge base 
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occurring in it. In this way, it will be automatically connected to every other 
formula with which it shares an atom. 

This notion of relevance gives us a “quick and dirty” method for retrieving 
the most relevant elements of a set of formulas. 

Epstein [11] proposes some desiderata for a binary relation intended to rep- 
resent relevance. Epstein’s conditions are: 

R1 - 7 Z(<p, <p) 

R2 - 7 Z(ip,ip) i S lZ(-xp,if>) 

R3 - lZ(<p, ip) iff 7 Z(ip, <p) 

R4 - 7 Z(<p, 7 — > ip) iff 7 Z(ip, 7 ) or 7 Z(tp, ip) 

R5 - 7 Z(ip, 7 Aip) iff 7 Z(ip, j ip). 

It is easy to see that syntactical relevance satisfies Epstein’s desiderata. More- 
over, Rodrigues [12] has shown that this is actually the smallest relation satis- 
fying the conditions given in [ 11 ]. 

Unfortunately, propositional logic is very often not enough to express many 
problems found in Artificial Intelligence. We would like to move to first-order 
logic. As is well known, this makes the inference problem much harder. On the 
other hand, having a problem which is hard enough is a good reason to abandon 
completeness and try some heuristics. 

In what follows, we adapt the definition of syntactical relevance relation to 
deal with full first-order logic with equality. 

Definition 7. Let a be a formula. Then C(a) is the set of non-logical constants 
(constants, predicate, and function names) which occur in a. 

Definition 8 (tentative). Let p be a binary relation defined as: 

p(a, (3) if and only if C(a) D C(/ 3) ^ 0 

It is easy to see that this relation satisfies Epstein’s desiderata. 

One problem with the definition above is that we very often have predicates, 
functions or constants that appear in too many formulas of the knowledge base, 
and could make all formulas seem relevant to each other. In this work, we consider 
one such case, which is the equality predicate (~). 

Based on the work done by Epstein on relatedness for propositional logic, 
Krajewski [13] has considered the difficulties involved in extending it to first- 
order logic. He notes that the equality predicate should be dealt with in a differ- 
ent way and presents some options. The option we adopt here is that of handling 
equality as a connective, i.e. , not considering it as a symbol which would con- 
tribute for relevance:. 

Definition 9. Let p be a binary relation defined as: 

p(a, (3) if and only if ( C(a ) fl C((3)) \ {^} ^ 0 

We can now use p as the relatedness relation needed to structure the relevance 
graph, and instantiate the general framework. 

In the next section, we describe how this approximate inference has been im- 
plemented and some results obtained, which show that the use of the relatedness 
relation p does bring some gains in the inference process. 
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4 Implementation and Results 



In this section, we show how the framework for approximate inference based on 
syntactical relevance has been implemented and the results which were obtained. 

The idea is to have the knowledge base structured by the relatedness relation 
p and to use breadth- first search in order to retrieve the most relevant formulas. 
The algorithm receives as input the knowledge base K , the formula a which we 
are trying to prove, the relation p, a global limit A of resources (time, memory) 
for the whole process, a local limit A which will be used at each step of the ap- 
proximation process, an inference engine /, which will be called at each step and 
a function H which decides whether it is time to move to the next approximation 
step. 

The basic algorithm is as follows: 



Input: K,a,p , A (Global limit of resources), A (Local limit of resources), I 
(inference engine, returns Yes, No, or Fail), H (function that decides whether to 
apply next inference step). 

Output: Yes, No or Fail. 

Data Structures: Q (a queue), K l (a subset of K) 

1. * = 1; Q = 0; K* = 0. 

2. Enqueue^, a); Mark(a); 

3. While ( Q yf 0) and (used resources < A) do 

3.1. (3 = Dequeue(Q); 

3.2. For all ip such that p((3, ip) and <p is not marked, do 

3.2.1. Enqueue (Q,<p); Mark(cp); 

3.2.2. K' = K l U {<p}; 

3.2.3. If H(K,K\i,a) then 

3. 2. 3.1 If I(K l ,a, A) = Yes, then return Yes; 

3. 2.3. 2 i = i + l\ K i = K ( - i ~ 1 '>- 

4. If used resources < A then return I(K l , a , A); 

else return Fail. 



In our tests, the inference engine used (the function I) was the theorem prover 
OTTER [14]. OTTER is an open-source theorem prover for first-order logic 
written in C. The code and documentation can be obtained from http://www- 
unix.mcs.anl.gov/AR/OTTER. OTTER was modified so that it could receive as 
a parameter the maximum number of sentences to be considered at each step. 
It was also modified to build the relevance graph after reading the input file. 
We call the modified version RR-OTTER (Relevance- Reasoning OTTER). The 
algorithm was implemented in C and the code and complete set of tests are 
available in [15]. 

The function H looks at the number of formulas retrieved at each step. At 
the first step, only the 25 most relevant formulas are retrieved, i.e., for i = 1, 
when \K l \ = 25 , H returns true. 
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In order to test the algorithm, two knowledge bases were created, putting 
together problems of the TPTP 2 (Thousands of Problems for Theorem Provers) 
benchmark. Base 1 was obtained by putting together the axioms of the problems 
in the domains “Set theory” , “Geometry” , and “Management” , and it contained 
1029 clauses. Base 2 was obtained adding to Base 1 the axioms of the problems in 
“Group Theory”, “Natural Language Processing”, and “Logic Calculi”, yielding 
1781 clauses. Only problems in which the formula was a consequence of the base 
were considered. 

Two sets of tests were run. The first one (Tests 1) contained 285 problems 
from the “Set Theory” domain, and used as the knowledge base Base 1 described 
above. The function H was set to try to solve the problems with 25, then 50, 
100, 200, 250, 300, 350, 400, 450, 500, 550, and 600 clauses at each step. For 
each step, the maximum time allowed was 12.5 seconds. This gives a global time 
limit of 150 seconds. 

The second set of tests (Tests 2) contained 458 problems from the “Group 
Theory” domain and used Base 2. It was tested with 25, 50, 100, 200, 250, 300, 
350, 400, 450, and 500 clauses at each step, with the time limit at each step 
being 15 seconds. Again, the global limit was 150 seconds. 

In order to compare the results obtained, each problem was also given to the 
original implementation of OTTER, with the time limit of 150 seconds. 

The table below shows the results for six problems from the set Tests 1. 



Problem 


Time OTTER (s) 


Time RR-OTTER (s) 


# of sentences used 


SET003-1 


- 


13.06 


50 


SET018-1 


- 


63.21 


300 


SET024-6 


0.76 


12.96 


50 


SET031-3 


0.71 


0.45 


25 


SET183-6 


- 


98.46 


400 


SET296-6 


0.74 


38.08 


200 



We can see that for the problems SET003-1, SET018-1, and SET183-6, which 
OTTER could not solve given the limit of 150 seconds, RR-OTTER could find 
a solution, considering 50, 300 and 400 clauses respectively. In this cases, it is 
clear that limiting the attention to relevant clauses brings positive results. For 
problem SET031-3, the heuristic proposed did not bring any significant gain. 
And for problems SET024-6 and SET296-6, we can see that OTTER performed 
better than RR-OTTER. These last two problems illustrate the importance of 
choosing a good function H . Consider problem SET024-6. RR.-OTTER spent the 
first 12.5 seconds trying to prove it with 25 clauses and failed. Once it started 
with 50 clauses, it took 0.46 seconds. The same happened in problem SET296-6, 
where the first 37.5 seconds were spent with unsuccessful steps. 

The following is a summary of the results which were obtained: 



http:/ /www.tptp.org/ 
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Tests 1 (285 problems) 


Tests 2 (458 problems) 


Solutions found by OTTER 


111 


212 


Solutions found by RR-OTTER 


196 


258 


Average time 1 OTTER 


93 sec 


128 sec 


Average time 1 RR-OTTER 


61 sec 


138 sec 


Average time 2 OTTER 


3.04 sec 


11.6 sec 


Average time 2 RR-OTTER 


6.9 sec 


23.07 sec 



We can see that, given the global limit of 150 seconds, RR-OTTER solved 
more problems than the original OTTER. The lines “Average time 1” consider 
the average time for all the problems, while “Average time 2” takes into account 
only those problems in which the original version of OTTER managed to find a 
solution. 

An interesting fact which can be seen from the tests is the influence of a bad 
choice of function H. For the problems in Tests 1, if we had started with 50 
sentences instead of 25, the Average time 2 of RR-OTTER would have been 3. 1 
instead of 6.9 (for the whole set of results, please refer to [15]). 

As it would be expected, when RR-OTTER manages to solve a problem 
considering only a small amount of sentences, the number of clauses it generates 
is much lower than what OTTER generates, and therefore, the time needed is 
also shorter. As an example, the problem SET044-5 is solved by RR-OTTER 
at the first iteration (25 sentences) in 0.46 seconds, generating 29 clauses, while 
OTTER takes 9.8 seconds and generates 3572 new clauses. This shows that, at 
least for simple problems, the idea of restricting attention to relevant sentences 
helps to avoid the generation of more irrelevant data and by doing so, keeps the 
search space small. 

5 Conclusions and Future Work 

In this paper, we have extended the framework proposed in [9] to deal with 
first-order logic and showed how it can be used to perform approximate theorem 
proving. 

The method was implemented, using the theorem prover OTTER. Although 
the implementation is still naive, we could see that in many cases, we could obtain 
some gains. The new method, RR-OTTER, managed to solve some problems 
that OTTER could not prove, given a time limit. 

The tests show that the strategy of considering the most relevant sentences 
first can be fruitful, by keeping the search space small. 

Future work includes more tests in order to better determine the parameters 
of the method, such as the function H , and improving the implementation. 
Instead of external calls to OTTER, we plan to use otterlib [16], a C library 
developed by Flavio Ribeiro. The idea is that we could then keep the inference 
state after each step of the approximation (for example, all the clauses that were 
generated), instead of restarting from scratch. 
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Abstract. The use of rewriting systems to transform a given expression 
into a simpler one has promoted the use of rewriting logic in several areas 
and, particularly, in Software Engineering. Unfortunately, this applica- 
tion has not reached the treatment of Functional Dependencies contained 
in a given relational database schema. The reason is that the different 
sound and complete axiomatic systems defined up to now to manage 
Functional Dependencies are based on the transitivity inference rule. In 
the literature, several authors illustrate different ways of mapping infer- 
ence systems into rewriting logics. Nevertheless, the explosive behavior of 
these inference systems avoids the use of rewriting logics for classical FD 
logics. In a previous work, we presented a novel logic named SL FD whose 
axiomatic system did not include the transitivity rule as a primitive rule. 
In this work we consider a new complexity criterion which allows us 
to introduce a new minimality property for FD sets named atomic- 
minimality. The SL fd logic has allowed us to develop the heart of this 
work, which is the use of Rewriting Logic and Maude 2 as a logical 
framework to search for atomic-minimality. 

Keywords: Knowledge Representation, Reasoning, Rewriting Logic, 
Redundancy Removal 



1 Introduction 

E.F. Cood introduces the Relational Model [1] having both, a formal framework 
and a practical orientation. Cood’s database model is conceived to store and 
to manage data in an efficient and smart way. In fact, its formal basis is the 
main reason of their success and longevity in Computer Science. In this formal 
framework the notion of Functional Dependency (FD) plays an outstanding role 
in the way in which the Relational Model stores, recovers and manages data. 

FDs were introduced in the early 70’s and, after an initial period in which 
several authors study in depth their power, they fell into oblivion, considering 
that the research concerning them had been completed. Recently, some works 
have proved that there is still a set of FDs problems which can be revisited in a 
successful way with novel techniques [2,3]. 

* This work has been partially supported by TIC-2003-08687-C02-01. 
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On the other hand, rewriting systems have been used in databases for data- 
base query optimization, analysis of binding propagation in deductive databases 
[4], and for proposing a new formal semantics for active databases [5]. Never- 
theless, we have not found in the literature any work which uses rewriting logic 
(RL) to tackle an FD problem. FD problems can be classified in two classes 
according to one dimension: their abstraction level. So, we have instance prob- 
lems (for example the extraction of all the FD which are satisfied in a given 
instance relation) and schema problems (for example the construction of all the 
FDs which are inferred from a given set of FDs). The first kind of problems are 
being faced successfully with Artificial Intelligence techniques. Schema problems 
seem to be suitable to be treated with RL. 

There are some authors who introduce several FDs logics [6-8]. All of these 
logics are cast in the same mold. In fact, they are strongly based on Armstrong’s 
Axioms [6] , a set of expressions which illustrates the semantics of FDs. These FD 
logics cited above were created to formally specify FDs and as a metatheoretical 
tool to prove FD properties. Unfortunately, all of these FD axiomatic systems 
have a common heart: the transitivity rule. The strong dependence with respect 
to the transitivity inference rule avoids its executable implementation into RL. 

The most famous problem concerning FDs is the Implication Problem: we 
have a set of FDs r and we would like to prove if a given FD can be deduced 
from r using the axiomatic system. If we incorporate any of these axiomatic 
systems into RL, the exhaustive use of the inference rule would make this rewrite 
system unapplicable, even for trivial FD sets. This limitation caused that a set 
of indirect methods with polinomial cost were created to solve the Implication 
Problem. Furthermore, other well-known FD problems are also tackled with 
indirect methods [9]. 

As the authors says about Maude in [10] : “The same reasons that make it a 
good semantic framework at the computational level make it also a good logical 
framework at the logical level, that is, a metalogic in which many other logics can 
be naturally represented and implemented”. To do that, we need a new FD logic, 
which does not have the transitivity rule in its axiomatic system. Such a logic 
was presented in [11] and we named it the Functional Dependencies Logic with 
Substitution (SL FD ). In this work we use for the first time RL to manage FDs. 
Particularly, we apply Maude as a metalogical framework for representing FD 
logics illustrating that “Maude can be used to create executable environments 
for different logics” [10]. 

The main novelty of SL F£) is the replacement of the transitivity rule by an- 
other inference rule, named Substitution rule 1 with a non-explosive behavior. 
This rule preserves equivalence and reduce the complexity of the original ex- 
pression in linear time. Substitution rule allows the design of a new kind of FD 
logic with a sound and complete inference system. These characteristics allow 
the development of some FD preprocessing transformations which improve the 
use of indirect methods (see [12]) and open the door to the development of a 
future automated theorem prover for FDs. 

1 We would like to remark that our novel rule does not appear in the literature either 
like a primitive rule nor a derived rule. 



